Sharp Load Thresholds for Cuckoo Hashing 

Nikolaos Fountoulakis and Konstantinos Panagiotou* 

Max-Planck-Institute for Informatics 
Saarbriicken, Germany 



Abstract 

The paradigm of many choices has influenced significantly the design of efficient data 
structures and, most notably, hash tables. Cuckoo hashing is a technique that extends this 
concept. There, we are given a table with n locations, and we assume that each location 
can hold one item. Each item to be inserted chooses randomly k > 2 locations and has to 
be placed in any one of them. How much load can cuckoo hashing handle before collisions 
prevent the successful assignment of the available items to the chosen locations? Practical 
evaluations of this method have shown that one can allocate a number of elements that 
is a large proportion of the size of the table, being very close to 1 even for small values 
of k such as 4 or 5. 

In this paper we show that there is a critical value for this proportion: with high 
probability, when the amount of available items is below this value, then these can be 
allocated successfully, but when it exceeds this value, the allocation becomes impossible. 
We give explicitly for each k > 2 this critical value. This answers an open question posed 
by Mitzenmacher (ESA '09) and underpins theoretically the experimental results. Our 
proofs are based on the translation of the question into a hypergraph setting, and the 
study of the related typical properties of random fc-uniform hypcrgraphs. 



1 Introduction 

A fundamental data structure that is omnipresent in various computer science applications is 
the dictionary. A dictionary typically supports the following three basic operations: insertion 
and deletion of elements as well as membership queries. One way to design dictionaries is by 
using hash-based techniques, which become highly efficient when the power of multiple choices 
is exploited. In particular, if each item has the choice among several different locations in the 
table, and simultaneously the items may be moved dynamically to different locations, very 
efficient data structures that achieve small lookup times and high memory utilizations can 
be designed. In this work we will focus on a variation of this concept that is called Cuckoo 
Hashing. 

Cuckoo hashing was introduced by Pagh and Rodler in [23] . The general setting considered 
here is a slight variation of it, as defined by Fotakis, Pagh, Sanders and Spirakis in [15]. We 
are given a table with n locations, and we assume that each location can hold only one item. 
Further generalizations where two or more items can be stored have also been studied, see 
e.g. [mElE], but we will not treat those cases. Moreover, there is a set X of items, such that 
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each x E I is assigned k > 2 random distinct locations ^i(a;), . . . , ik( x )- This assumption may 
be somehow idealized, as exponentially many bits would be needed to store such fully random 
sets of locations. However, there is theoretical evidence that even "simple" hash functions 
can be sufficient in practice, provided that the underlying data stream fulfills certain natural 
conditions; we refer the reader to the papers [2T] by Mitzenmacher and Vadhan and [TU] by 
Dietzfelbinger and Schellbach, and the references therein. 

A natural question in cuckoo hashing is the following. As the number of available items 
increases, it becomes more and more unlikely that all of them can be inserted into the table 
such that each item is assigned to one of its k desired locations. In other words, if \2\ 
is "small", then with high probability, i.e., with probability arbitrarily close to 1 when n 
becomes large, there is such an assignment to the locations in the table that respects the k 
choices of the items. On the other hand, if \2\ becomes "large", then such an assignment 
does not exist with high probability (trivially, this happens at the latest when n + 1 items 
are available). The important question is whether there is a critical size for I where the 
probability for the existence of a valid assignment drops instantaneously in the limiting case 
from 1 to 0, i.e., whether there is a load threshold for cuckoo hashing. This question is at 
the core of this article. More precisely, we shall say that a value c* k is the load threshold for 
cuckoo hashing with k choices for each element if 



It is not a priori clear whether such a critical value c£ exists at all. However, as a warm-up 
example let us consider the case k = 2, that is, each item has two preferred random locations. 
Then there is a natural analogy to random graphs. Indeed, let us think of the n locations as 
the vertices of the graph, and of the items as edges, which encode the two choices for each item. 
What we obtain in this way is the Erdos-Renyi random (multi-)graph G nm , where m = \I\. 
Moreover, it easy to see by applying Hall's Theorem that G n ^ m has no subgraph with more 
edges than vertices if and only if the corresponding items can be assigned to the corresponding 
locations such that the choices of all items are respected. It is well-known that the property 
"Gn jm has a subgraph with more edges than vertices" coincides with the appearance of a 
giant connected component that contains a linear fraction of the vertices, see e.g. the detailed 
book |17j . As the latter is known to happen around m = n/2, we readily obtain that the load 
threshold for cuckoo hashing and k = 2 is at c\ = 1/2. Particularly, roughly half of the table 
can be filled, while respecting the choices of all items. 

In the case k > 3 things get significantly more complex, and there are almost no precise 
results about the existence and the value of the corresponding load threshold c* k (see the next 
section for a detailed account on the related work and previously known bounds). However, 
experiments in [15] suggest that e.g. C3 « 0.91, c\ « 0.97 and c\ 0.99. In other words, three 
choices allow already an utilization of 91% of the table, and e.g. five choices are sufficient 
to fill the table almost completely. The main result of this work is the following theorem, 
which says that for all k > 3 the load threshold c£ exists, and determines its value. This in 
particular answers a question by Mitzenmacher [20] related to the maximum load that can 
be achieved when k > 3 choices are allowed for the elements. 

Theorem 1.1. For any integer k > 3 let £* be the unique solution of the equation 
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there is an assignment of \X\ = \cn\ items to a table 
with n locations that respects the choices of all items 
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Then c£ = fc ^ e _^,^ k _ 1 is the load threshold for cuckoo hashing with k choices per element. 
In particular, if there are [era] items, then the following holds with probability 1 — o(l). 

1. If c < c* k , then there is an assignment of the items to a table with n locations that 
respects the choices of all items. 

2. If c > c* k , then such an assignment does not exist. 

Numerically we obtain for example that C3 = 0.917, cj = 0.976 and = 0.992, where "=" 
indicates that the values are truncated to the last digit shown. Note that this matches in all 
cases the empirically observed values from |15| up to the second decimal digit. Moreover, a 
simple calculation reveals that c£ = 1 — e~ k + o(e~ k ) for k — > 00, i.e., the utilization rate of 
the table approaches one exponentially fast in k. 

In the remainder of this section we first briefly survey some known results about cuckoo 
hashing in various settings, and we refer the reader to the survey [20J by Mitzenmacher, which 
contains a much more complete treatment. Then we give a high-level overview of our proofs, 
and point out the novelties in our approach and the differences to past approaches. 

Related Results A breakthrough in hashing with multiple choices was achieved by Azar, 
Broder, Karlin, and Upfal [2], who studied the following balls-into-bins game. Suppose that 
we sequentially place n balls into n bins such that for each ball we choose randomly two bins, 
and place the ball into the least full. Then this yields with high probability an exponential 
improvement regarding the maximum number of balls in any bin, compared to the game 
where each ball is just placed into a random bin. Particularly, in [2] it was shown that the 
maximum load grows like '"f^f " , which in most practical scenarios is very small. A variant 
of this scheme was later considered by Vocking [21], who showed even smaller upper bounds 
for the resulting maximum load. 

Cuckoo hashing is a further development of the above idea [23]. There, the items not only 
have multiple choices, but are also allowed to change their current location on demand. This 
might of course lead to a sequence of (recursive) movements of the items, until the newly 
arrived and all preceding items are inserted successfully into the table, if this was possible. 

The case of k = 2 choices is theoretically well- understood. As already mentioned, there 
is a natural correspondence to random graphs, which makes the analysis tractable; we refer 
the reader to [8] and [12] for further details. However, the cases k > 3 are not that well- 
understood. 

Generally we can distinguish between two different settings of cuckoo hashing: the online 
case, where items are inserted or deleted over time from the data structure, and the offline 
case, where all required items are available already in the beginning. This paper is devoted 
to the study of the offline setting, and the important question in this context is about the 
maximum number of initially available items that can be inserted in the table with high 
probability, so that the choices of all items are respected (this is precisely the load threshold 
as defined above in (jl.ip ). The work [3] by Batu, Berenbrink and Cooper considers, among 
other things, the case k = 4, and derives an explicit expression for c\, which matches the 
value in Theorem 11.11 Moreover, they derive upper bounds for the cases k = 3 and k = 5. 
Moreover, (non-tight) lower bounds on c* k for large k have been achieved with the use of 
matrix techniques, see the paper [9] by Dietzfelbinger and Pagh. 

In the online setting many different approaches have been explored, see e.g. |11 [ 115 1 fT5] . 
Particularly, the popular "random-walk" insertion algorithm has attracted much attention, 
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and Frieze, Melsted and Mitzenmacher [16] showed at most polylogarithmic insertion time 
for sufficiently large k and appropriate (but not optimal) maximum loads of the table. We 
refer the reader to [2UJ for a more detailed treatment. 

Finally, a different line of research in cuckoo hashing aims at a deamortization of the 
time needed for the insertion of the elements. Note that while a membership query can be 
answered by checking at most k locations in the table, an insertion procedure (like e.g. the 
random-walk algorithm) could take significantly longer time. The papers [T] and [19] contain 
a very detailed treatment of this topic, and we refer the reader to them. 

Proof Methods Typically, but not exclusively, cuckoo hashing problems are modeled with 
a bipartite graph: the parts represent the locations and the items, respectively, and the edges 
represent the choices of the items. It is easy to see that the existence of a matching in this 
graph that covers all item-nodes is equivalent to the existence of an assignment of all items to 
the locations such that the choices are respected. In other words, we are seeking the threshold 
for the existence of such a matching. 

Our approach is also based on random graphs, but we adopt a different viewpoint. Par- 
ticularly, we model cuckoo hashing with random (uniform) hyper graphs, similarly to the 
case k = 2. More precisely, the n vertices represent the locations, and each available item 
corresponds to a hyperedge that contains k vertices. It is not very difficult to see (see also 
Section [2j that the existence of an assignment of all items to the locations such that the 
choices are respected is guaranteed to exist, if the underlying hypergraph does not contain 
any subgraph with more edges than vertices. Otherwise, an assignment is not possible. 

Unfortunately, unlike in the case k = 2, studying the emergence of a giant component is not 
sufficient. In fact, it turns out that the giant component, shortly after it appears in a random 
/s-uniform hypergraph, has significantly less edges than vertices. What we are interested in 
here is the threshold for the appearance of a dense subgraph, i.e., a subgraph that contains 
at least as many edges as vertices. An upper bound for this threshold can be obtained by 
considering the so-called core, which is the maximum subgraph that has minimum degree at 
least two. By using well-known results by Cooper [6] and Kim [18] we infer in Theorem 12.21 
that the sought threshold is at most c* k . The proof can be found in Section [3.11 

The main contribution of our work is the corresponding lower bound, i.e., to show that 
whenever c < c* k there are no dense subgraphs in the random £>uniform hypergraph with n 
vertices and [cn\ edges. This was shown already by Bohman and Kim [4] for the case k = 4. 
However, most of their proof is tailored specifically to this case, and does not extend to the 
whole range k > 3. Our proof is heavily based on the use of the probabilistic method and 
tools from large deviation theory (see the book [13] by Ellis and [7] by Dembo and Zeitouni), 
and boils down to a delicate maximization of a function that involves entropic terms as well 
as terms that arise from various probability estimates that we perform. We refer the reader 
to Section 14.11 for further details. 

2 Proof of the Main Result 

As already mentioned, we model cuckoo hashing using random hypergraphs: the n vertices 
represent the locations, and each available item is a hyperedge that contains k vertices. Let 
us become a little more precise. Let H* m k denote a hypergraph that is drawn uniformly at 
random from the set of multigraphs with n vertices and m edges, where each edge contains k 
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vertices. Then, an instance of H* m k corresponds precisely to a cuckoo hashing scenario, 
where the table consists of n locations, there are m items in total, and each items chooses k 
random locations as specified above. 

Note that if* k may have multiple edges, which happens when two items choose the 
same k locations. It will be more convenient to work on a slightly different random graph 
model, which we denote by H n ^ mk . There, multiple edges are forbidden, i.e., iT n ,m,fc is a 
hypergraph drawn uniformly at random from the set of all simple hypergraphs with n vertices 
and m edges. The next proposition says that the two models are essentially equivalent. 

Proposition 2.1. Let V be a property of simple hypergraphs. Then, for any c > and k > 3 

IP (H n , [cnl , k £P)=P (K,lanl,k €V)(1 + O(l)). 

Proof. The expected number of pairs of edges that are incident to the same set of vertices 

in H n,[cni,k is at mOSt ( m ) 2 • (fe) • (fc) _2 = Tllis ilT1 P lieS that H n,[cn\,k is sim P le with 

probability 1 — o(l), and the statement follows immediately. □ 

In what follows we will be referring to a hyperedge of size k as a k-edge and we will be 
calling a hypergraph where all of its hyperedges are of size k a k-graph. 

Suppose now that we are given an empty table with n locations, and m items together 
with k choices for each one of them. Note that an assignment of the items to the locations 
of the table such that every item gets assigned to one of its preferred locations is possible, if 
for the corresponding /c-graph H the following condition is satisfied: every one of the induced 
subgraphs of H has less /c-edges than vertices. Indeed, every one of the edges corresponds to 
an item that has been allocated to precisely one location in the table, or equivalently, to one 
vertex of the subgraph. But the existence of more /c-edges than vertices within an induced 
subgraph implies that the number of items that have been allocated to a certain set of table 
locations exceeds the number of admissible locations - clearly this is impossible. We call 
the ratio of the number of edges of a /c-graph over its number of vertices the density of this 
/c-graph. 

It turns out that this necessary condition is also sufficient - see the proof of Theorem 11.11 
below. In other words, the crucial parameter that determines whether an assignment of the 
items to the locations of the table is possible is the maximal density of an induced subgraph. 
The next theorem says that if the number of items exceeds c k n, then there is a subgraph with 
density > 1. Before we state it, let us introduce some additional notation. We define the core 
of a hypergraph H to be the maximum subgraph of H that has minimum degree at least 2; 
if such a subgraph does not exist then we say that the core is empty. The core of random 
hypergraphs and its structural characteristics have been studied quite extensively in recent 
years - see for example the papers by Cooper [6] or Molloy [22]. 

Theorem 2.2. Let c* k be defined as in Theorem \l.l\ If c > ct, then with probability 1 — o(l) 
the core ofH ncni ^ has density greater than 1. 

This theorem is not very difficult to prove, given the results in [6] and |18| . However, it 
requires some technical work, which is accomplished in Section [3l The heart of this paper is 
devoted to the "subcritical" case, where we show that the above result is essentially tight. 

Theorem 2.3. Let c* k be defined as in Theorem If c < ct, then with probability 1 — o(l) 
all subgraphs of H n ,cn,k have density smaller than 1. 
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The proof of this theorem is technically more challenging and it is spread over the remain- 
ing sections. With Theorems 12.21 and 12.31 at hand we are in a position to give the proof of our 
main result about cuckoo hashing. 

Proof of Theorem \l.l[ Let us construct an auxiliary bipartite graph B = (I,L;E), where I 
represents the m items, L represents the n locations, and £ E if £ is one of the k 

preferred locations for item i. Note that it is possible to assign all items to locations such 
that each item is assigned to one of its preferred locations if and only if there is a matching 
in B that covers all vertices in I. By Hall's Theorem such a matching exists if and only if for 
all I' C / we have that \I'\ < \T(I')\, where T(I') denotes the set of neighbors of the vertices 
in I' inside L. 

As a next step, let us describe more precisely the quantity r(/'). If we think of cuckoo 
hashing in terms of the corresponding random hypergraph H n ^ m ^, then r(J') is precisely the 
set of vertices that are contained in the hyperedges that correspond to the items in So, 
if c < c£, Theorem 12.31 guarantees that with high probability for all /' we have |r(/')| > \I'\, 
and therefore a matching exists. On the other hand, if c > c* k , then there is with high 
probability an /' such that |r(/')| < |/'|; choose for example /' to be the set of items that 
correspond to the edges in the core of -ff n ,m,fc- Hence a matching does not exist in this case, 
and the proof is completed. □ 

3 Properties of random /c-graphs 

The aim of this section is to determine the value c* k and to prove Theorem 12.21 Moreover, we 
will introduce some known facts and tools that will turn out to be very useful in the study of 
random hypergraphs, and will be used later on in the proof of Theorem 12.31 as well. 

(More) Models of random A;-graphs 

For the sake of convenience we will carry out our calculations in the H n ^ p ^ model of ran- 
dom fc-graphs. This is the "higher-dimensional" analogue of the well-studied G n , p model, 
where each possible (2-)edge is included independently with probability p. More precisely, 
given n > k vertices we obtain H n ^ p ^ by including each &-tuple of vertices with probability p, 
independently of every other /c-tuple. 

Standard arguments show that if we adjust p suitably, then the -ff n ,p,fc model is essentially 
equivalent to the -ff n ,cn,fc model. Roughly speaking, if we set p = c/c/(^~}), then H n ^ p ^ 
is expected to have p(T) = cn edges. In fact, much more is true. Let V be a convex 
hypergraph property, that is, whenever we have three hypergraphs on the same vertex set 
H±, H2, H3 such that Hi C H2 C #3 and Hi,H^ E "P, then H2 £ V as well. (We also assume 
that V is closed under automorphisms.) Clearly any monotone property is also convex and, 
therefore, in particular the properties examined in Theorem 12.31 The following proposition is 
a generalization of Proposition 1.15 from [17^ p. 16] and its proof is very similar to the proof 
of that - so we omit it. 

Proposition 3.1. LetV be a convex property of hypergraphs, and letp = cfc/(t~i)> where c > 
0. IfF(H ntPjk £ V) —> 1 as n -> 00, then P (H n ^ k £ V) -> 1 as well. 
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Working on the core of H npk — the Poisson cloning model 

Recall that the core of a hypergraph is its maximum subgraph that has minimum degree at 
least 2. At this point we introduce the main tool for our analysis of the core of H np j,- This 
is the so-called Poisson cloning model that was introduced by Kim |18j and was used for a 
variety of problems. Our treatment here was inspired by the analysis of Bohman and Kim [5] 
in the context of Achlioptas processes. 

The Poisson cloning model H n ^ p ^ for /c-graphs with n vertices and parameter p S [0, 1] is 
defined as follows. Consider a set of n vertices V n and consider also a family, indexed by this 
set, of i.i.d. Poisson random variables with parameter A := p^Zi) ■ For each v 6 V n let d(v) 
denote the corresponding random variable from this family. Then H npk is constructed in 
three steps as follows. First, for every v E V n the degree of v is a random variable and 
equals d(v). Second, for each such v we generate d(v) copies, which we call v -clones or simply 
clones, and choose uniformly at random a matching from all perfect A;-matchings on the set 
of all clones. Note that such a matching may not exist - in this case we choose a random 
matching that leaves less than k clones unmatched. Finally, we generate -ff n ,p,fc by contracting 
the clones to vertices, i.e., by projecting the clones of v to v itself for every v G V n . 

Note that the last two steps in the above procedure are together equivalent to the config- 
uration model Hd h-> where d = (d±, . . . , d n ), for random hypergraphs with degree sequence d. 
In other words, -ffd,fc is a random multigraph where the zth vertex has degree dj. 

The following statement is implied by [181 Theorem 1.1], and says that the study of H np ^ 
may be reduced to the study of the Poisson cloning model. 

Theorem 3.2. IfF^H ntPtk £ V\ -> as n -> oo, then ¥ (H n ^ k £ V) -> as well. 

The next result that we shall exploit gives a precise description of the core of H n „^. 
Particularly, Theorem 6.2 in [18] implies the following statement, where we write "x ± y" for 
the interval of numbers (x — y, x + y). 

Theorem 3.3. Let A2 := mm x> o ^ 1 _ e x x ^-i . Moreover, let x, be the largest solution of the 

equation x = (l — e~ xck ) k ~ 1 , and set £ := xck. Assume that ck = p(jlZi) = ^2+°"> where a > 
is fixed. Then, for any < 5 < 1 the following is true with probability 1 — o(l). If denotes 
the number of vertices in the core of H n p k , then 

fi2 = (1 — e - ^ — £e~^)n ± 5n. 

Furthermore, the core itself is distributed like the Poisson cloning model on fi2 vertices, where 
the Poisson random variables are conditioned on being at least two and have parameter A c ^, 
where A C)k = £ + (3, for some \(3\ < 5. 

In what follows, we say that a random variable is a 2-truncated Poisson variable, if it is 
distributed like a Poisson variable, conditioned on being at least two. We immediately obtain 
the following corollary. 

Corollary 3.4. Let n<i and m.2 denote the number of vertices and edges in the core of H n p ^. 
Then, for any < 5 < 1, with probability 1 — o(l), 

no = (1 — e~^ — £e~^)n ± 5n and = —£(1 — e~^)n ± 5n, 

k 

where £ = xck and x is the largest solution of the equation x = (1 — e _xcfc ) fc_1 . The same is 
true for the quantities hi and m,2 °f Hn,p,k- 
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Proof. The statement about 712 follows immediately from the first part of Theorem 13.31 to- 
gether with Theorem l3.2i The second part is not substantially more difficult. We will use the 
statement about the distribution of the core of H n ^ p ^ in Theorem [331 that is, the 2-truncated 
Poisson cloning model, and we transfer this result to -ff n ,p,fc using again Theorem 13.21 

Let us condition on certain values of and A c & that lie in the intervals stated in The- 
orem 13.31 Then the total degree of the core of -ff n ,p,fc is the sum of independent 2-truncated 
Poisson random variables d\, . . . , dn 2 with parameters in A Ci fc G £ ± 5. Set D := Y17=i ^i- A 
simple calculation shows that Var(D) = 0(77-2)- Therefore, Chebyschev's inequality yields 

P (\D — E (D)| > n^ 3 ) = O (nj V3 ) . 

Also, E (_D) = (1 ± c5) • 1 " for some appropriate c > 0. Therefore, by averaging 

over all choices of 77,2 in the interval stated in Theorem 13.31 we obtain that 777-2 = tj£(1 ~~ 
e~^)n ± dSn, for some d > 0. The proof completes by choosing the initial 5 as 5/d, and by 
applying Theorem 13.21 □ 



3.1 Proof of Theorem 12.21 and the Value of c* k 

In this section we will prove Theorem l2.21 i.e., we will show that the core of H np k has density 
at least one if p = ck/ ClZi) anc ^ c > c fc- Let < 5 < 1, and denote by 772 and 7772 the number 
of vertices and edges in the core of H n>p> f.. By applying Corollary 13.41 we obtain that with 
probability 1 — o(l) 

ri2 = (1 — — £e~^)n ± 5n and rri2 = — e _ ^)n ± <5n, 

where £ = xc/c and x is the largest solution of the equation x = (1 — e _a:cfc ) fc_1 . The value 
of c£ is then obtained by taking 772 = 777-2, and ignoring the additive error terms. The above 
values imply that the critical £* is given by the equation 



1 - e"€* - £*e"«* ' 

This is precisely (|1 .2|) . So, A; determines £* and x satisfies x = (1 — e-^)^- 1 = (1 — ) fc ~ 1 . 
Therefore, the critical density is 

1 

c * := jfc(i- e -^)fc-i- (3 - 1} 

Proof of Theorem \2.SX The above calculations imply that with probability 1 — o(l), for any 
< 5 < 1 

772 fc 1 — e * — £e * 
Moreover, if c = ct, then 7712/71-2 = 1 =t 25. To complete the proof it is therefore sufficient to 
show that 777-2/772 is an increasing function of c. Note that the ratio is the expected 

value of a 2-truncated Poisson random variable with parameter £, which is known (and easily 
seen) to be increasing in £. Recall that £ = xck. We conclude the proof by showing the 
following claim. 
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Claim 3.5. The quantity £ = xck is increasing with respect to c. So, with probability 1 — o(l) 

m 2 -. ., * ,"12-,., * 

— < 1 , if c < c k ana — > 1 , if c > c k . 
Indeed, recall that x satisfies x = (1 — e xcfc ) fc_1 . Equivalently, xck = ck(l — e xck ) k ~ 1 . We have 

ck = (i _ e -C)fe-i' ( - 3 ' 2 ^ 

An easy calculation shows that the derivative of the function on the right-hand side has no 
positive roots: 

' _ (1 - e-t) k ~ 2 (1 - e"* + e-*£(k - 1)) «>o) 
= (i _ e -S)2(fe-i) > °' 



That is, the function is strictly increasing for £ > and, therefore, when ck 

increases, then the root of (|3.2jl . that is, the product xc/c, increases as well. □ 

4 Proof of Theorem [231 

Let us begin with introducing some notation. For a hypergraph H we will denote by Vh its 
vertex set and by Eh its set of edges. Additionally, vh and en shall denote the number of 
elements in the corresponding sets. For U C Vh we denote by vjj, ejj the number of vertices 
in U and the number of edges joining vertices only in U. Finally, djj is the total degree in U, 
i.e., the sum of the degrees in H of all vertices in U. 

We say that a subset U of the vertex set of a hypergraph is 1-dense, if it has density at 
least one, i.e., the subgraph induced by U has at least as many edges as vertices. Working 
towards the proof of Theorem 12.31 we begin with showing that whenever c < 1, H niCn j~ does 
not contain large 1-dense subsets, for any k > 3. In particular, we will first argue about sets 
with no more than 0.7n vertices; for them it is sufficient to use a first moment argument that 
is based on rough counting. For larger sets of vertices we need more sophisticated arguments 
regarding the structure of the core of H n>cn; j- - we present those in the next subsection. 

The following statement deals with the case k > 5; is not best possible, but it suffices for 
our purposes. 

Lemma 4.1. Let k > 5, c < 1. Then H ncn f. contains no 1-dense subset with less than 0.7n 
vertices with probability 1 — o(l). 

Proof. The probability that an edge of -ff n ,cn,fc is contained completely in a subset U of the 
vertex set is ('£')/(*) < (^) fc . Let J < u < 0.7n. Then 

n \ I cn \ kun n (2H{u)+ku\nu) 



F (3 1-dense subset with un vertices) < • u < e 

\un ) \un J 

where H(x) = — xlnx — (1 — x)kix denotes the entropy function. Note that the second 
derivative of the exponent in the expression above is k ~^^ , which is positive for x G (0, 1). 
Hence the exponent is convex, implying that it is maximized either at u = k/n or at u = 0.7. 
Note that 

2#(0.7) + jfc0.71n(0.7) < 2#(0.7) + 5 • 0.71n(0.7) < -0.02 
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and that 

\n J n \n J n \n 

So, the minimum is obtained at u = k/n, and we conclude the proof with 

P (3 1-dense subset with < 0.7n vertices) < V 0(n~ fc2+2fc ) < 0(re~ 14 ). 

k/n<u<0.7 



□ 



We shall need the following stronger claim for random 3-uniform hypergraphs. The as- 
sumptions might look somehow artificial at this point, but will become very handy later on, 
cf. Proposition 14.31 

Lemma 4.2. Let H be a 3-graph, and call a set U C Vh bad if 

eu = \U\ and Ve e E H :\enU\ ^ 2. 

Then, for any c < 0.95 

P (-ff n ,cn,3 contains a bad subset U with < n/2 vertices) = o(l). 

Proof. Let p = d /("j 1 ) , where d = 3 • 0.95 < 2.85. A simple application of Stirling's formula 
reveals 

P(i^ n>P) 3 has exactly cn edges) = (1 + o(l))(27rcn) -1 / 2 . 

As the distribution of -ff n ,cn,3 is the same as the distribution of H n p 3 conditioned on the 
number of edges being precisely cn we infer that 

P (^ff n ,cn,3 contains a bad subset U with < n/2 vertices) 

= 0{n l l 2 ) ■ P (-ff n ,p,3 contains a bad subset U with < n/2 vertices) . 

To complete the proof it is therefore sufficient to show that the latter probability is o(n -1 / 2 ). 
We accomplish this in two steps. Note that if a subset U is bad, then certainly \U\ > 4. Let 
us begin with the case s := \U\ < n 1 / 4 . There are at most n s ways to choose such a U, and at 
most s 3s ways to choose the edges that are contained in U . Hence, the probability that H n ,p,3 
contains a bad subset with at most n 1 / 4 vertices is bounded for large n from above by 

n l/4 n l/4 / x S „l/4 „l/4 



s=4 




£- s s 3 V = E Ht^tt <£ n (1+3/4) ~ 2 -o(i) <E K 1/4+o(1) = n " 1+ ° (1) 



The case n 1//4 < \U\ < n/2 requires a more careful analysis. Let n~~ 3 / 4 < u < 1/2, and denote 
by H(x) = — x\nx — (1 — x) ln(l — x) the entropy function. For future reference let us note 
that 

'2d" 



d 



lnp = ln v cvl J = ln w J + 0(1/n) ' (4,1) 

By using the above notation, there are ( " ) < e nH ^ ways to select a C/ with un vertices. 
Moreover, the number of ways to choose the un edges that are completely contained in U is 

e[un) \ , I e(un) 




exp < un ln 
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Finally, the number of all possible edges with two endpoints in U and one endpoint in Vjj \ U 
is ( n ^)(l — u)n. By combining all these facts it readily follows that the probability P u 
that H n ,p,3 contains a bad subset U with un vertices is at most 




Using (|4.ip and the definition of p we obtain that uniformly for n 3 / 4 < u < 1/2 
Pu < exp L (h(u) + nln ~ c'y - cV(l - u)j + 9(1) j . 

The derivative of the exponent with respect to d is given by "^ 3+2c 3 " — , which is easily seen 
to be positive whenever d < 3 and u < 1/2. We infer that P u is maximized if we choose d = 
2.85. Moreover, an easy calculation reveals that the derivative of the exponent with respect 
to u equals ln(du(l — u)) + 3 — ln(3) — 2c'u(l — u). As the function ln(c'x(l — x)) is monotone 
increasing for < x < 1/2 and the function — 2dx(l — x) is monotone decreasing, there is 
at most one ra -3 / 4 < uq < 1/2 where the derivative of the exponent vanishes. Moreover, it 
is easily seen that for c = 2.85 the derivative of the exponent is negative at u = n -3 / 4 and 
positive at u = 1/2. Hence uq is a global minimum, and P u is maximized either at u = n~ 3 / 4 
or at u = 1/2. Elementary algebra then yields that the left point is the right choice, giving 

1/4 

the estimate P u = o(2~ n ), and the proof concludes by adding up this expression for all 
admissible n -3 / 4 < u < 1/2. □ 



4.1 Subgraphs of the 2-Core 

Our general proof strategy for Theorem 12.31 is as follows. Suppose that -ff n ,en,fc contains a 1- 
dense subset of vertices, and let U be such a minimal (with respect to the number of vertices) 
one. Then each edge in the subgraph induced by U is contained in at least two vertices of U, 
as otherwise we could remove a vertex of degree one or zero in order to obtain a 1-dense 
set V C U. This implies that the core of Pl n ,cn,k contains all minimal 1-dense subsets. In the 
remainder of this section we therefore focus on the analysis of the core of H n ^ cn ^, and we will 
show that with probability 1 — o(l) it does not contain any 1-dense subset. 

Proposition 13.11 and Theorem 13.21 guarantee that we can perform all calculations in the 
Poisson cloning model, i.e., it is sufficient to consider the core of H n>p> k, where p = ck/ ClZi)i 
and c = c* k — 7 < 1, where 7 > will be determined later. Let C = C{H n ^^) denote the core 
of H n p j,. For notational convenience we use the symbols ni and mi to denote the number of 
vertices and the number of edges, respectively, of C. Our proof then proceeds by exposing C 
in the following three stages. 

1. We will expose the values of and A c ^ and condition on the event that they obtain 
specific values within the ranges stated in Theorem 13.31 

2. We will expose the degrees d = (d±, . . . , d n2 ) of the vertices of the core, which, again 
according to Theorem l3.3l are distributed as i.i.d. 2-truncated Poisson random variables 
with parameter A c fi- 



ll 



3. We expose Hd t k, i-e., the edges between the clones that emerge from the degree sequence 
that was exposed in Stage 2. 



In order to obtain tight bounds for the probability that there are 1-dense subsets in the core 
of i?n )P) fc, we will exploit more sophisticated properties of such sets. In particular, we will use 
the following statement, which was observed by Bohman and Kim [I]. We present a proof for 
the sake of completeness. 

Proposition 4.3. Let H be a k-graph with density < 1 and let U be an inclusion maximal 
1-dense subset ofVn- Then e\j = vu and all edges e G Eh satisfy |e Pi U\ 7^ k — 1. 

Proof. If ejj > vu, then let U' = U U {v }, where v is any vertex in Vh \ U. Note that such a 
vertex always exists, as U 7^ Vh- Moreover, denote by d the degree of v in U, i.e., the number 
of edges in H that contain v and all other vertices only from U. Then 

eu' = eu + d > e v > 
vu' vu + 1 ~ vu + 1 ~ 

which contradicts the maximality of U. Similarly, if there was an edge e such that |e n U\ = 
k — 1, then we could construct a larger 1-dense subset of Vh by adding the vertex in e \ U 
to U. □ 

The following lemma bounds the probability that a given set of the core is maximal 1- 
dense, assuming that the degree sequence has been exposed. That is, the randomness is that 
of the 3rd stage described above. However, the calculations rely neither on the certain values 
of ri2 and A Cj fc which are determined by Theorem 13.31 nor on the distribution of the degrees 
which is obtained through the Poisson cloning model, but it is much more general. A similar 
statement was shown in [3] for the special case k = 4, and the proof is inspired from there. 



Lemma 4.4. Let k > 2, d = (d\, . . . , djy) be a degree sequence and U C {1, . . . , N}. More- 
over, set m = k~ l 5^i=i d% an d Q = {km)~ 1 ^iec/^«- Assume that m < N. If Bu denotes the 
event that U is an inclusion maximal 1-dense set of Hd t k, then 

P {Bu) < (1 + o(l))Vfrn~( ™ ) (2 k -k- l) m ~ |f/| • e - kmH{q) , 



where H(x) = — xlnx — (1 — x) ln(l — x) denotes the entropy function. 

Proof. Recall that -Hd.fc is obtained by beginning with di clones for each 1 < i < N and by 
choosing uniformly at random a perfect fe-matching on this set of clones. This is equivalent 
to throwing km balls into m bins such that every bin contains k balls. In order to estimate 
the probability for Bu assume that we color the kqm clones of the vertices in U with red, and 
the remaining k(l — q)m clones with blue. So, by applying Proposition 14.31 we are interested 
in the probability for the event that there are exactly \U\ bins with k red balls and no bin 
that contains exactly one blue ball. 

We estimate the above probability as follows. We begin by putting into each bin k 
black balls, labeled with the numbers l,...,k. Let K, = {1, . . . , k}, and let X±, . . . , X m be 
independent random sets such that for 1 < i < m 



V/C'C/C : W{Xi = K!) =q\ K \l 



\k-\K'\ 
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Note that is the binomial distribution Hm(k,q). We then recolor the balls in the zth 
bin that are in Xi with red, and all others with blue. So, the total number of red balls is 
X = Y^iLi \Xi\, and it follows that the number of ways to partition kqm red and k(l — q)m 
blue balls in m bins such that each bin has k balls is P (X = kqm) ■ 2 km . Consequently, overall 
there are 

Z = F {X = kqm) 2 km ■ (kqm)\ ■ (k{l - q)m)\ ■ (k\)~ m 

ways to throw kqm red and k(l — q)m blue balls in m bins such that each bin has k balls. Note 
that E (X) = kqm, and that X is distributed like Hm(km,q). A straightforward application 
of Stirling's Formula gives then 

P (X = kqm) = P (X = E (X)) = (1 + o(l))(2vrg(l - q)km)~ 1/2 . 

Let Rj be the number of X^s that contain j elements. Then, the probability that there are 
exactly \U\ bins with k red balls and no bin that contains exactly one blue ball is precisely 

P = P (X = kqm A R k = \U\ A = 0) 2 krn • (kqm)\ • (jfe(l - q)m)\ ■ {k\)~ m . 

Using this notation we may estimate 

P , 

F (Bu) = -<(! + o(l))V2m • P (X = kqm A R k = \U\ A R k ^ = 0) . (4.2) 
Let pj = F (\Xi\ = j) = (^)<7 J (1 — q) k ~i ■ Moreover, define the set of integer sequences 



A 

Then 



k-2 k-2 
b k _ 2 ) G N^ 1 : J2bj=m- \U\ and J^jbj = kqm - k\U\ 
3=0 j=0 



(6 ,...,6 fc _ 2 )e^ v u ' 3=0 



Now observe that the summand can be rewritten as 



m\ kqm( )k{ i- g) mf m-\U\ \tt^- 

\u\r 1 q) W---,h-J f} \j 



Thus, by using (|4.2p we infer that 



P (Bu) < (1 + o(l))v^L™ V 9m (l - g )*( 1 -9) m (2* — fc — l) m ~ |c/| • 5, 

where 

(6o,...,6 fc _a)e^ v u ' ' 7 i=o V / 



□ 
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As already mentioned, the above lemma gives us a bound on the probability that a subset 
of the core with a given number of vertices and total degree is maximal 1-dense, assuming 
that the degree sequence is given. Particularly, it exploits the randomness that is present in 
the 3rd stage of the exposure process defined in the beginning of Section 14.11 In order to 
show that the core contains no 1-dense subset, we will estimate the number of such subsets 
using the first two stages of the exposure strategy. 

For a positive integer t and age [0, 1] let X q> t = X Q)t (C) denote the number of subsets 
of C with t vertices and total degree d = [q ■ /c?n 2 J, where m 2 is the number of edges in C. 
(For the sake of the simplicity we will say that the number of vertices of a set is its size and its 
total degree is its degree. Moreover, we will omit writing "|_-J" from now on.) Note that X q j 
is a random variable that depends only on the outcomes of the first two stages of the exposure 
of the core. Let also xQ denote the subset of these sets that are maximal 1-dense. Then the 
following corollary is an immediate consequence of Markov's inequality. 

Corollary 4.5. Let U be an arbitrary set of vertices of C of size t and degree d = q ■ fcm 2; 
and let Bu be defined as in Lemma \4-4\ Then 

p { x q} > I X gtt )<X q , t ¥(Bu). 

Let 5 > 0. Working on the probability space of the first two stages of our exposure process, 
we will estimate the expected value of X q ^ and thereafter, using Markov's inequality, we will 
show that Xqj itself is with probability 1 — o(l) no more that its expectation multiplied by 
a polynomial factor. Recall that n 2 denotes the number of vertices in the core of H n , p ,k- By 
applying Theorem 13.31 we obtain that with probability 1 — o(l) 

n 2 = n(l - e" ? - £e" ? ) ± 5 2 n, and A Cifc = £ ± 5 2 , 

where £ = xck and x is the largest solution of x = (1 — e~~ xck ^~ 1 m As £ is increasing with 
respect to c (cf. Claim |33|) . there exists a 7 = j(5) > such that c = c* k — 7 and £ = £* — 5, 

where £* is the unique solution of k = ^l 6 ^ . Therefore 

n 2 = n(l - e" r - f*e~**) ± 0(6n), and A Cjfc = C - 5 ± 5 2 . 

We shall be using these facts without further reference. This is all we need from the ran- 
domness contained in the first stage of our exposure process. On the probability space of 
the second stage, we will condition on an event £, which informally says that the sum of 
degrees of the vertices does not deviate much from its expected value. More specifically, re- 
call that the degrees of the vertices of C are i.i.d. 2-truncated Poisson random variables with 
parameter A Cj ^. Therefore, the expected value of the sum of the degrees is equal to 

D ■= n 2 A ^ (1 fc ~ e / T Ac ' fc j A - = na(fc(l - e k 5) ± 9(5 2 )) (4.3) 

for some constant > 0. Now, let d\, . . . ,d n2 denote the random variables that are the 
degrees of the vertices of C, and let £ be the event "D = Y17=i di £ D ± n 2 / 3 ". Moreover, 
recall that m 2 denotes the number of edges in C, i.e., 1112 = D/k. As we observed in the proof 
of Corollary 13.41 Var(D) = 0(n 2 ) and, therefore, Chebyschev's inequality yields: 

P(£) = 1 - O (n~ 1/3 ) . (4.4) 
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Note that on the event £ 

m 2 = n 2 (l - e k 5 + 9(<5 2 )), where e k is given by — '- = k(l - e k 5 + Q(5 2 )). (4.5) 

es — £ — 1 

Let E\ be the event that conditional on £ we have X q t < n 2 E(X q j \ £)■ Markov's inequality 
immediately implies that P (£± \ £) > 1 — 1/n 2 . So, Corollary 14.51 yields the following. 

Corollary 4.6. Let U be an arbitrary set of vertices of C of size t and degree d = q ■ kmi, 
and let Bjj be defined as in Lemma \4-4\ Then 



X\ L > > 0) < n z E(X q , t | £)F(Bu)+o(l). 
Proof. We have 

P (x$ > o) < P (x$ > o I E x n £) + P {£[) + P {£) = P > I £ x n £ \ + o(l). 

Note that any event about given the value of X q j is independent of £, as this regards 
only the randomness in the 3rd stage of the exposure process. The proof finishes by applying 
Corollary [33J □ 

Let us proceed with the estimation of E(X (?j j | £). We will use the following theorem, 
which is a special case of Theorem II. 4. 1 in |13j . 

Theorem 4.7. Let X be a random variable taking real values and set c(t) = ln~E(e tx ), for 
any t > 0. For any z > we define L(z) = sup tgR {zi — c(t)}. If X±, . . . , X s are i.i.d. random 
variables distributed as X, then for s — > oo 

i=1 1 <z) < exp(-sinf{/(x) : x < z}{l + o(l))) . 



s 

The function I(z) is non-negative and convex. 

Remark By the large deviation theory the above bound is asymptotically (i.e., for large s) 
tight. See [13] or [7] for a more detailed treatment. 

With this theorem at hand we are ready to present our main tool for estimating E(X q j \ £). 

Lemma 4.8. Let T z be the unique solution of z = ^-e-^z-T e~ T z ' w ^ ere z > Moreover, let 
I{z) = z (\nT z - ln£) - In (e Tz - T z - l) + In (e 5 - £ - l) , (4.6) 



and set 1(2) := In 2 — 2 ln^ + ln(e^ — £ — 1). Then I(z) is continuous for all z > 2 and convex. 

It has a unique minimum at fi = = — e k°~ + ©(^ 2 )); where I(fi) = 0. 

Let X\, . . . ,X S be i.i.d. 2-truncated Poisson random variables with parameter A c ,k- Then, 
uniformly for any z such that 2 < z < /i, we have 

^ X% < z^j < exp (-sl(z)(l + o(l)) + 0(s5 2 )) , 
as s — > oo. 
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Proof. We shall first calculate c(t) = lnK(e tx ), where X is a random variable which follows 
a 2-truncated Poisson distribution with parameter £. Note that 

1 ~ ptlct- p £e* _£ p t_-> 

expfcW) = E(e«, = ^ £V«lJ- = 

Differentiating tz — c(t) with respect to t we obtain 



(tz - c(i))' = [tz - In 



e« - £ - 1 M 



e« et - £e* - 1 ef - £ - 1 e? et - £e* 

Setting T := £e* yields the compact form 

T{e T -l) 



{tz - c(t))' = z 



e T — T — 1 



As this suggests, it will be convenient to parameterize zt — c{t) in terms of T. 

Setting the derivate to 0, we obtain a unique T that solves the above and which we 
denote T z . The uniqueness of the solution for z > 2 follows from the fact that the function 
x(e x — l)/(e x — x — 1) is strictly increasing with respect to x and, as x approaches 0, it tends 
to 2. In other words, T z is the unique positive real number that satisfies 

Letting t z be such that T z = £ t e tz , we obtain: 

-c{t z ) = - ln(e T * - T z - 1) + ln(e* - £ - 1) 

and 

t z z = z(lnT z — ln£). 

The function — c(i) is concave with respect to t (cf. Proposition VILLI in [131 p. 229]); 
also adding the linear term zt does preserve concavity. So t z is the point where the unique 
maximum of zt — c(t) is attained over t € R. Therefore, 

I(z) = z{\nT z - ln£) - ln(e Tz — T z — 1) + ln(e 5 - £ - 1). 

For z = ^3737, we have T z = £ and therefore the above equality yields /(//) = 0. 

As far as 1(2) is concerned, note that strictly speaking this is not defined, as there is no 
positive solution of the equation 2 = T(e T — l)/(e T — T — 1). However, the function on the 
right-hand side converges to 2 as T — > from the right. So, we can express 1(2) as a limit: 

1(2) := lim (2\nT - ln(e T - T - 1)) - 21n£ + ln(e* - £ - 1). 
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But 



rp2 rp2 

lim ( 2 In T — \n(e T —T—l)) = lim In -= = lim In ■ 



t^o+ v v ;; T^0+ e T -T-l T-0+ ^ + ^ + ... 

= lim In -j = =ln2, 

i + if + ■•• 

and therefore 

1(2) = ln2-21n£ + ln(e f -£-l). 

All the above analysis was for a 2-truncated Poisson variable X with parameter £. In order to 
obtain the statement of the lemma, recall that A c f, = £=t<5 2 , and that the Xi's are 2-truncated 
Poisson variables with parameter A c f~- A straightforward argument by using Taylor's theorem 
then shows that c(t) : = lnE(e 1 ) = c(t) + 0(5 2 ), which holds uniformly for t in any bounded 
domain. Moreover, set I(z) := sup tgK {.zi — c(t)}, for all z > 2. Therefore uniformly for all 
z>2, I(z) =I(z) + 0(S 2 ). 

Also, according to Theorem |L7] the function I(z) is non-negative and convex on its domain. 
So if z < [i, then inf{/(x) : x < z} = I(z) and the second part of the lemma follows. □ 

The following lemma bounds the expected value of X q j. 

Lemma 4.9. Let t = (3ri2 and q > (3. Then, for any n 2 sufficiently large and 5 > sufficiently 
small 

E(X q , t | 8) < 2^ exp (-n 2 (l - 0)1 (^yf^) (1 + °(1)) + 0(n 2 8 2 ) 
where I(z) is given in (|4.6p . 

Proof. There are (™ 2 ) ways to select a set with t vertices. We shall next calculate the proba- 
bility that one of them has the claimed property, and the statement will follow from a simple 
union bound. Let U be a fixed subset of the vertex set of C that has size t and let d\, . . . ,dt 
denote the random variables that are degrees of the vertices in U. Thus, we want to estimate 
the probability of the event Yl\=i di = q • km 2 conditional on £. Let ck+i, ■ ■ ■ , d n , 2 denote the 
random variables that are the degrees of the vertices which do not belong to U; these are i.i.d. 
2-truncated Poisson variables with parameter A c j. = £ ± 5 2 . If £ is realized, then recall (|4.5p 
and note that 

Y!i=t+l d i km 2 -qkm 2 k(l - q)m 2 k(l - q) , , „, i2 u 

= n 2 -t = -JT^pW * TT (1 " ek5 + @{6 )] - 

Note that the last expression is at most fc(l - e k 5 + 9(<5 2 )) = f^-i + @(^ 2 ) = A* + @(<5 2 ). 
So, by applying Lemma [4^81 using the convexity of L(z), and the fact P {£) = 1 — o(l) the 
proof is completed. □ 

Corollary 14.61 along with Lemmas 14.41 and 14.91 yield the following estimate. 
Lemma 4.10. Let t = (3n 2 and q > 0, and let m 2 = (1 — e^S ± Q(5 2 ))n 2 . Then 

Xf} > 0) = o(l)+ 



(2 k -k- I)™ 2 "* exp y—km 2 H (q) - n 2 (l - (3)1 ( K - _ ) + 0(n 2 5 2 ) 
Moreover, when q < (3, the above probability is 0. 



17 



The rest of this section is devoted to the proof of the following statement. 

Lemma 4.11. Let m 2 = (1 — e^S ± Q(S 2 ))n 2 , where 5 > is sufficiently small. Then the 
following holds with probability 1 — o(l). For any t = Bn 2 where 0.7 < 8 < 1 — e^b/2 and 
any q such that 8 < q < 1 — 2<yl ~ k ^ we have xf$ = 0. 

The lower bound for q in the above lemma comes from the fact that otherwise the corre- 
sponding probability is zero, cf. Lemma 14.101 Moreover, the upper bound in the range of q 
stems from the fact that the average degree of the complement of a set with t vertices and 
total degree q ■ krri2 is at least two. More precisely, the total degree of the core satisfies for 
small 5 > 

2(1 - 8)n 2 2(1 - 8) 

km 2 > q ■ km 2 + 2(n 2 - t) => q < 1 - \ H> < 1 - , ■ 

km 2 k 

With the above result at hand we can finally complete the proof of Theorem 12.31 

Proof of Theorem \2.3l Recall that it is sufficient to show that the core C of -ff n ,p,fc contains 
with probability 1 — o(l) no maximal 1-dense subsets (see also the discussion in the beginning 
of Section |4*TT|) . where p = cA;/(^ 1 ). Note also that it is sufficient to argue about subsets of 
size up to, say, (1 — ekS/2)n 2 , where n 2 is the number of vertices in C, as (|4.5p implies for 
small 5 that all larger subsets have density smaller than 1. 

Let k > 5. By applying Lemma 14.11 we obtain that LL n ^ C n,k does not obtain any 1-dense 
set with less that 0.7n vertices, and the same is true for Hn^^, by Proposition 13.11 and 
Theorem 13.21 In particular, C does not contain such a subset, and it remains to show the 
claim for sets of size at least 0.7ra > 0.7n2- The proof is completed by applying Lemma 14.111 
as we can choose 5 > as small as we please. 

The case k = 3 requires slightly more work. Lemma 14.21 guarantees that C has no subset 
with < n/2 vertices that contains exactly as many edges as vertices, and there is no edge that 
contains precisely two vertices in that set. In other words, by using Proposition 14.31 C does 
not contain a maximal 1-dense set with n/2 vertices. However, we know that 

Pfe^* - 1) 

n 2 = (l _ e « _ £* e -£* ± 0(5))n, where 3 = ^ _ 1 _' 

Numerical calculations imply that n 2 > 0.63n for any 5 that is small enough. So, C does not 
contain any maximal 1-dense subset with less than n/2 < n 2 /(2 ■ 0.63) < 0.77n 2 vertices. In 
this case, the proof is completed again by applying Lemma 14.111 

Finally, the case k = 4 was treated in [4] . This completes the proof of Theorem 12.31 □ 



4.2 Proof of Lemma 14.111 

Let 

/(A q) := 2 H{[3) + (1 - 8) ln(2 fe - k - 1) - kH (q) - (1 - 8)1 



k{\ 



1-8 



By using Lemma 14.101 and the well-known bounds (" 2 ) < e n2H ^ and ("J 2 ) < ( n t 2 ) we infer 
that 

^ InP (ij; > 0) < f(8, z) + e k 5 {kH (q) - ln(2 fc - k - 1)) + 0{5 2 ). 
We will show the following. 
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Claim 4.12. There exists a C > such that for any small enough e > the following is 
true. Let 0.7 < /3 < 1 — e, and g as m Lemma \4-ll\ Then 

f(P,q)<-Ce + 0(5 2 ). 

This completes the proof of Lemma 14.111 as follows. We distinguish between the following 
cases. First, note that if 0.7 < (3 < 1 — v^, then the above claim yields for sufficiently 
small 5 > 

— InP (xj$ > o) < -C\/$ + 0(5) < -CVS/2. 

This implies that with probability 1 — e"^^™ 2 ) we have A^ = for all t = (3n2 and q. 

Finally, if 1 — y/5 < f3 < 1, then the above claim implies that /(/?, q) = 0{5 2 ). Moreover, 
by the monotonicity of the entropy function and q > (3 we have for sufficiently small 5 > 

kH (q) - ln(2 fc - k - 1) < fcfl"(0.99) - ln(2 fc - jfe - 1). 

A simple calculation and the fact H(0.99) < 0.06 show that the above expression is negative 
for all k > 3. This completes the proof also in this case. 

The rest of the paper is devoted to the proof of Claim 14. 121 We proceed as follows. We 
will fix arbitrarily a f3 and we will consider f((3,z) solely as a function of z. Then we will 
show that if go = qo((3) is a point inside the domain where df/dq = 0, then /(/?, go) < —C\E. 

Additionally, we will show that this holds for /(/?, (3) and / (^(3, 1 — 2 ^ fc ^ 
Bounding f((3,q) at its critical points 

Let (3 be fixed. We will evaluate f((3,q) at a point where the partial derivative with respect 
to q vanishes. To calculate the partial derivative with respect to g, we first need to determine 
the derivative of I(z) with respect to z. 

According to Lemma 14. 8\ I(z) = z (In T z — In £) — In (e Tz — T z — l) + In (e^ — £ — l) . So 
differentiating this with respect to z we obtain: 

r>(*\ i t i c, 1 dT * eTz ~ 1 dT z @$ 

I(z) = lnT z -lnt + z-—- eTz _ Tz _ i — = In^-ln^ (4.8) 

However, in the differentiation of / we need to differentiate I(k(l — g)/(l — (3)) with respect 
to g. Using (|4.8p . we obtain 



9/ 



■«) 



(ln^-lne), 



9g 1-/3 
where H q is the unique solution of the equation 

fc(l ~q) _ H q (e H " - 1 



1-/3 e H «-H g -l' 

(Observe that the choice of the range of q is such that the left-hand side of the above equation 
is at least 2. So, H q is well-defined.) Also, an elementary calculation shows that H'{q) = 

In (~q^j- All the above facts together yield the derivative of /(/?, q) with respect to g: 
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Therefore, if go is a critical point, that is, if 9 ^Q q ' q ^ 



= 0, then qo satisfies 

q=qo 



T ^-go v. ^(1 - g ) T (e T ° - 1) 

T = ? > where — j = "Tj; ™ 7- 4 - 9 ) 

go 1-/3 e 10 - T - 1 



At this point, we have the main tool that will allow us to evaluate f(/3, qo). We will use (14. 9p 
in order to eliminate To and express /(/3, go) solely as a function of go- 
Claim 4.13. For any given (3, if qo = qo(@) is a critical point of f{(5,q) with respect to q, 
then 

/(Ago) = 2#(/3) + (1 - 0) In + fc In (g ) ^ 

+ (1 - 0) (21n£ + In (1 - g ) - ln(g ) + ln(l - 0) - In (A;g - £(1 - /?))) . 
Proof of Claim. Firstly note that 

/(*^)-^^ln^-]n(^-ro-l) + ]n(^- € -l 



ln l^rJ" ln( 0_1)+ln v 

whence 

1-gcA , ^ /e T ° -T -l 



-d - /3)i (^f) = -Hi - qo ) m (i^) + (i - /3) m ( 

= -/c(l - g ) In (1 - g ) + k In (g ) - kq In (g ) + (1-/3) In 



e* - f - 1 
e T ° - T - 1 



£ - i - 1 



Also, the definition of the entropy function implies that 

-kH (g ) = kqo In (g ) + k(l - g ) In (1 - g ) . 

Thus 

-(1 - /?K ( ^il^ ) " ^ (qo) = k In (g ) + (1 - 0) In (^|^) . (4-11) 

Now we will express e T ° as a rational function of To and Zq. Setting z := k ^\_ q ^ and solving 
(14, 9p with respect to e T ° yields 

e T = z + Tp(z-l) 
z - T 

Therefore 



.To 



1 -Tn 



2 



+ T z - T - z + T - T 5 + T 2 _ T 2 



z - T z - T 

Note that 

ATI -g ) £(l-g ) (1 - q )(kq - f(l - 0)) 



z - T 



1-/3 90 (1 - P)qo 
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Thus we obtain: 

ln(e To - 1 - T ) = 2 In T - ln(z - T ) 

^2 In £ + 2 In (1 - q ) - 2 In (g ) - In (1 - g ) - In (fcgo " £(1 " /?)) 
+ In go + In ((1-/3)) 
=2 In £ + ln(l - go) - In (go) + ln(l - 0) - In (kq - £(1 - /3)) . 

Substituting this into (|4,lip and adding the remaining terms, we obtain (|4.10p . □ 

We will now treat go as a free variable lying in the interval where q lies into, and we 
will study /(/3, go) for a fixed /3 as a function of qo- In particular, we will show that for any 
fixed /3 in the domain of interest f(f3,qo) is increasing. Thereafter, we will evaluate /(/?, go) 
at the largest possible value that go can take, which is 1 — 2 ^Z , and show that this value is 
negative. 

Claim 4.14. For any k > 3 and for any (3 > 0.7 we have 

df{P, go) . Q 
<9go 

Proof of Claim. The partial derivative of /(/?, go) with respect to go is 
df(J3,qo) k 1-/3 1-/3 k(l-P) 



dq go l - go go kq - £(l - /3) 



Since go < 1 — 2 ^ fc ^ , we obtain 

fc l — go 2 

Also go > /3 and £ < A;. Therefore, 

fcgo - £(1 -P)>kp- fc(l -/3) = 2(3k-k = k{2(3 - 1). 
Substituting these bounds into the formula of d ^ q '^ yields: 
df(Ago) > A _ * ^ ^ 



<9g go 2 g 2/3-1 



2/3 2 2/3 — 1 - V 2 K 2 / 3 - !) 
But 

1 1-/3 

> 



2 fc(2/3-l)' 

as /c(2/3 - 1) > 2(1 - /3), which is equivalent to 2j3{k + 1) > k + 2, or, 2/3 > |±f = 1 + jjj. 
Since /3 > 0.7 and > 3, the above holds. □ 
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Now we proceed with setting q^ := 1 — 2 ^ fc ^ into f(P,qo) and obtain a function which 
depends only on (3, namely: 

h(j3) := 2H(J3) + (1 - P) In (%zJlzl\ + H n ^ 



+ (1 - 0) (21n£ + In (2(1 - /?) 2 ) - In (k - 2 + 2/3) - In (k - 2 + 2/3 - £(1 - /?))) . 

We will show that /i(/3) is a convex function with respect to /?. Thus to obtain an upper 
bound on h{(3) in the interval [0.7, 1 — e], it will be sufficient to evaluate h(0.7) and h(l — e). 

Claim 4.15. For all k > 3 and all 0.7 < (3 < 1 

d 2 K(5) 
dp 2 

Proof. The second derivative of h(P) is given by 

d 2 h(P) 2 4(1-/3) | 2 + g | (2 + e)fe 



d/? 2 /3 (k + 2/3-2) 2 fc- 2 + 2/3-6 + 6/3 (fc - 2 + 2/3 - 6 + £/3) 2 ' 

The bounds on /3 imply that 

A; - 2 + 2/3 - 6(1 - /3) (5> < _1) A; - 2 + 2/3 - (k - 1)(1 - /3) < fc/3. 

By substituting this bound into the expression for d 2 h(P) / dp 2 we obtain that 

d 2 h(P) 2 4(1-/3) 2 + 6 (2 + Qk 
dp 2 ~ p (k + 2/3 - 2) 2 kp (fc/3) 2 ' 

Elementary algebra yields that 

2 2 + f 2 + f (/3<i) 1 / 2 + 6\ K>*-1) 1 / A; + 1\ 2 
h — — - + — -r+ > - -2 + 2— — - > - -2 + 2— — >-. 

p kp kp 2 - p 1 fey /3 V k J - k 



Also for all /3 > 0.7 we have 

4(1-/3) 



< 



(A; + 2/3 -2) 2 - (fc - l) 2 " 

Therefore > § - > 0, for all k > 3. □ 

The above findings are summarized in the following corollary. 

Corollary 4.16. Let 0.7 < /3 < 1 — e. If qo is a critical point of f(P,q) in the interval 
k<q<l- then 

f(P,q )<max{h(0.7),h(l-e)}. 
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k 


3 


4 


5 


6 


h(0.7) 


-0.02 


-0.11 


-0.2 


-0.3 



Table 1: Upper bounds on the values of h(/3), where (3 = 0.7. 



Bounding f(f3,q) globally 

To conclude the proof of the lemma it suffices due to above arguments to show that for some 
C > 

f((3, /?), f(P, 1 - 2(1 - p)/k), h(0.7), h(l -e)< -Ce + 0(5 2 ) 
for all 0.7 < (3 < 1 - e. We begin with h(0.7) and h(l - e). 

Claim 4.17. For all h > 3 we have h(0.7) < -0.02. 

Proof. Firstly, we give explicitly the values of h(0.7) for 3 < k < 6, see Table [TJ For k > 7 
we will provide an appropriate upper bound for h(0.7). Note that 2 • 77(0.7) < 1.23 and 
2 • (1 — 0.7) = 0.6. Moreover, observe that 

2 k - k - 1 €>*-i 2 fc - fc - 1 , 

< -TZi— iT- (4-12) 



ef - C - 1 ~ e fe ~ 1 - fc ' 

Additionally, note that kin ( fc " 2 ^ 2/3 ) = fcln(^^) < -0.6, and (1 - /3)ln(2(l - (3) 2 ) 
0.31n(0.18) < -0.51. So, £ < A; implies that 

/ 9 fc — fc — 1 \ 

h(0.7) < 0.12 + 0.3 In f -g — - — -J + 0.61n^ - 0.3 In ((/c - 0.6)(fc - 0.6 - 0.3fc)) 

< 0.12 + 0.31n ( — ^ — —J + 0.6 In k- 0.3 In ((jfe - 0.6) (jfe -1)0.6) 
/ 2 fc — — 1 \ 

< 0.12 + 0.3 In ( — j-j — J + 0.6 In fc - 0.3 In (0.5/c 2 ) 

/2 k — k — 1\ /2 k — k — l 

= 0.12 + 0.3 In | - 0.31n(l/2) < 0.33 + 0.3 In 



The last expression is negative for k = 7, and it is easily verified that it is decreasing with 
respect to k. This completes the proof. □ 

Claim 4.18. There exists C\ > such that h(l — e) < —C\e. 

Proof. Note that h(l) = and that the left derivative of h(/3) at (3 = 1 does exist. The 
convexity of h((3) and the fact that h(0.7) < imply that the (left) derivative of h((3) at f3 = 1 
is positive. Therefore applying Taylor's Theorem on the left of (3 = 1 yields the claim. □ 

Claim 4.19. For any k > 3 there is a C2 > suc/i that for any 0.7 < (3 < 1 — e we have 
f((3,(3)<-C 2 e + 0(5 2 ). 

Proof. By Lemma 14.81 we have = I'(^) = and then we observe that 
T (k{l-(3) 



I(k) = I (/.(I + 0(5))) = 7( M ) + /'(m)0(5) + /"^)0(5 2 ) = 0(5 2 ). 

1-/3 
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So, 

f(0, P) = ~(k- 2)H(fi) + (1 - 0) In ( 2 fc - k - 1 ) 



Note that for any k > 3 this function is convex with respect to 0, as —H{0) is convex 
and the linear term that is added does not affect its convexity. Moreover, for = 1, we 
have /(l, 1) = 0. Since #(0.7) > 0.6, we have 



/(0.7, 0.7) < -(k - 2) • 0.6 + 0.3 In f 2* - fc - 1 

The derivative of this function with respect to k is —0.6 + 0-3 . A simple calculation 

shows that the second summand is less than 0.35 for all k > 3, implying that /(0.7, 0.7) is 
decreasing with respect to k. So, we may set k = 3, thus obtaining /(0.7, 0.7) < —0.6 + 
0.3 In 4 < —0.1. The above analysis along with the convexity of f(0,0) finally imply with 
Taylor's Theorem the claimed statement. □ 

Claim 4.20. For all k > 3 there is a C3 > such that for all < 1 — e 

f(0,l-2(l- 0)/k) < -C 3 s. 
Proof. Substituting 1 — 2(1 — (3)/k for q into the formula of / we obtain: 

/ (A 1 - 2 -^T 1 ) = 2H ^) + (1 - H2 k -k-l)-kH ( k ~ 2 k +2 ^ ) - (1 - 01(2). 

Note that for /3 = 1 the expression is equal to 0. To deduce the bound we are aiming to, we 
will show that in fact / (/?, 1 — 2(1 — /3)/k) is an increasing function with respect to 0. That 
is, we will show that its first derivative with respect to (3 is positive. Note that by Taylor's 
Theorem, this implies the claim. 
We get 

Of (A 1 - _ ^ /l^A _ _ , _ b f 2-2/M + 



5/3 V /9 / V fc -2 + 2/3 / 

Substituting for 7(2) the value given in Lemma 14.81 and we obtain after some simplifications 
Of fcl-ta 



—) _ ln f(k-2(l-P)Y(e^-^-l) 



d{3 V 2(3 2 (2 k - k - l)^ 2 

We will show that the fraction inside the logarithm is greater than 1. Note first that the 
function (k — 2(1 — f3))//3 is decreasing with respect to (3 - so we obtain a lower bound by 
setting = 1. Moreover, the function (e x — x — l)/x 2 = ^ + |j + fr + ''' is increasing. So, 
as £ > — 1 we infer that e > . All these bounds together yield that 

(fc _2(i_/3))v-e-i) > i / fe V - fc ^ (413) 



2/3 2 (2 fc - k - 1)£ 2 2 \k-lj 2 k -k-l 
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For k = 3 the right-hand side of the above inequality is (1/2) • (3/2) 2 • ((e 2 — 3)/4) > 1. 
Similarly, for k = 4 we obtain that it is (1/2) • (4/3) 2 • ((e 3 — 4)/ll) > 1 and an analogous 
calculation says that it also greater than one for k = 5. For k > 6 note that 

e fe_1 — k e k ~ l - k 1 /e\ fc 
2 fc - jfc - 1 - 2 k - k ~ e \2/ 

We may omit in (|4.13p the factor k/ (k — 1) as it is greater than 1, and the right-hand side is 
now bounded below by ^ (| ) k . But (e/2) 6 > 2e; this establishes the fact that the derivative 
of f ((3,1 — 2(1 — (3)/k) with respect to (3 is positive, for all k > 3. □ 
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